Extracting Information from Archaeological Texts
نویسنده
چکیده
To address archaeology’s most pressing substantive challenges, researchers must discover, access, and extract information contained in the reports and articles that codify so much of archaeology’s knowledge. These efforts will require application of existing and emerging natural language processing technologies to extensive digital corpora. Automated classification can enable development of metadata needed for the discovery of relevant documents. Although it is even more technically challenging, automated extraction of and reasoning with information from texts can provide urgently needed access to contextualized information within documents. Effective automated translation is needed for scholars to benefit from research published in other languages.
منابع مشابه
Presenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملClustering Indus Texts using K-means
One of the most important undeciphered scripts of the ancient world is the Indus script. Earlier studies had focused on the correlations between signs in the Indus texts using various statistical and computational techniques such as N-grams or Markov chains. In the present study, K-means clustering, an unsupervised machine learning technique is used to identify clusters of similar texts without...
متن کاملOntologies and Information Extraction
An ontology is a description of conceptual knowledge organized in a computerbased representation while information extraction (IE) is a method for analyzing texts expressing facts in natural language and extracting relevant pieces of information from these texts. IE and ontologies are involved in two main and related tasks, • Ontology is used for Information Extraction: IE needs ontologies as p...
متن کاملImproving Satellite Quickbird-based Identification of Landscape Archaeological Features through Tasseled Cap Transformation and Pca
In this paper, the Tasseled Cap Transformation (TCT) was applied to QuickBird multispectral images for extracting archaeological features linked to ancient human transformations of the landscape. The investigation was performed on Metaponto, one of the most important archaeological sites in the South of Italy. The analysis was focused on the identification of ancient land divisions likely relat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015